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Abstract. In a multi-objective game, each agent individually evaluates each over¬ 
all action-profile on multiple objectives. I generalize the price of anarchy to multi¬ 
objective games and provide a polynomial-time algorithm to assess it 1 . 

This work asserts that policies on tobacco promote a higher economic efficiency. 

1 Introduction 

Economic agents, for each individual decision, make a trade off between multiple objec¬ 
tives, like for instance: time, resources, goods, financial income, sustainability, happi¬ 
ness and life. This motivated the introduction of a super-class of games: multi-objective 
(MO) games I5l22l . Each agent evaluates each overall action profile by a vector. His in¬ 
dividual preference is a partial rationality modelled by the Pareto-dominance. It induces 
Pareto-Nash-equilibria (PN) as the overall selfish outcomes. Furthermore, concerning 
economic models, such vectorial evaluations are a humble backtrack from the intrinsic 
and subjective theories of value, towards a non-theory of value where the evaluations 
are maintained vectorial, in order to enable partial rationalities and to avoid losses of 
information in the model. In this more realistic (behaviourally less assumptive) frame¬ 
work, in order to avoid critical losses of information on the several objectives in the 
model, thoroughly computing efficiency is a tremendous necessity Il3l23l . 

The literature on MO games is disparate and will be presented where relevant. After 
the preliminaries below, Section[2]generalizes the coordination ratio (CR, better known 
as “price of anarchy”) to MO games. Section [3] applies it to the efficiency of tobacco 
economy. Section[4]provides algorithm^] to assess the MO-CR. 

Let N = {1,..., n} denote the set of agents. Let A z denote each agent i’ s action- 
set (discrete, finite). Each agent i decides an action a 1 £ A 1 . Given a subset of agents 
M C N. let A M denote x i^M A z and let A = A N denote the set of overall action- 
profiles. Let O = {1,,..., d} denote the set of all the objectives, with d fixed. Let 
v 1 : A —> R+ denote an agent i’s individual MO evaluation function, which maps each 
overall action-profile a = (a 1 ,..., a") £ A to an MO evaluation v l (a) £ Rl. Hence, 
agent i’s evaluation for objective k is v l k {a) £ R+. Given an overall action-profile 
a £ A, a M is the restriction of a to A M , and a~ l to A N \^. 

Definition 1. A Multi-objective Game (MOG) is a tuple (AT, { A l }i e jv, O , {tP}jejv). 

For instance, MO games encompass single-objective (discrete) optimization problems, 
MO optimization problems and non-cooperative games. Assuming a = \A l \ £ N for 
each agent, the representation of an MOG requires na n d-dimensional vectors. 


1 Appendix[Ajshows that “smoothness” analysis 1201 cannot be applied to MO games. 







Let us now supply the vectors with a preference relation. Assuming a maximization 
setting, given x,y £ R'_[, the following relations state respectively that y (JTJ) weakly- 
Pareto-dominates and |2) Pareto-dominates x: 

y^x •(=> Vfc £ O, y k > x k (1) 

y R x •(=> Vfe £ O, y k > x k and 3k £ O, y k > x k (2) 


The Pareto-dominance is a partial order, inducing a multiplicity of Pareto-efficient out¬ 
comes. Formally, the set of efficient vectors is defined as follows: 

Definition 2 (Pareto-efficiency). For Y C R'_{_, the efficient vectors EFF[Y] C Y are: 

EFF[Y] ={ y * £Y \Vy£Y 7 not (y R y*)} 

(Similarly, let WST[F] = {y~ £ Y s.t. \/y £ Y, not ( y~ >- y)} denote the subset 
of worst vectors.) Pareto-efficiency enables to define as efficient all the trade-offs that 
cannot be improved on one objective without being downgraded on another one, that is: 
the best compromises between objectives (see e.g. Figure[]]>. 

At the individual scale, Pareto-efficiency defines a partial rationality , enabling to 
model behaviours that single-objective (SO) games would not model consistently. 

Definition 3 (Pareto-Nash equilibrium |22I ). In an MOG, an action-profile a £ A is 
a Pareto-Nash equilibrium (denoted by a £ PN), if and only if, for each agent i £ N: 

v i (a i , a~ i ) £ EFF[ ] 

where v I (A t ,a~ l ) denotes {v l (b l ,a~ l ) \ b 1 £ A *}. 

Pareto-Nash equilibria encompass most behaviourally possible action-profiles. For in¬ 
stance, whatever an agent’s subjective linear positive weighted combination of the ob¬ 
jectives, his decision is Pareto-efficient. One can distinguish behavioural objectives in¬ 
ducing PN and also objectives on which to focus an efficiency study. 

Equilibrium existence. In many sound probabilistic settings I17l8ll8ft . Pareto effi¬ 
ciency is not demanding on the conditions of individual rationality, hence there are 
multiple Pareto-efficient responses. Consequently, pure PN are numerous in average: 
|PN| £ 0(a < justifying their existence in a probabilistic manner. Furthermore, in 
MO games with MO potentials H 151171191 , the existence is guaranteed. 


Example 1 (A didactic toy-example in Ocean Shores). Five shops in Ocean Shores (the 
nodes) can decide upon two activities: renting bikes or buggies, selling clams or fruits, 
etc. Each agent evaluates his local action-profile depending on the actions of his inner- 
neighbours and according to two objectives: financial revenue and sustainability. 
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For instance, we have (b 1 , b 2 , a 3 , b 4 , b 5 ) £ PN, since each of these individual actions, 
given the adversary local action profile (column), is Pareto-efficient among the two 
actions of the agent (row). Even if the relative values of the objectives cannot be cer¬ 
tainly ascertained, all the subjectively efficient vectors are encompassed by the indi¬ 
vidual Pareto-efficiency. In this MO game, there are 13 Pareto-Nash-equilibria, which 
utilitarian evaluations are depicted in Figure [1] (Section [2}. 


2 The Multi-objective Coordination Ratio 


It is well known in game theory that an equilibrium can be overall inefficient with regard 
to the sum of the individual evaluations. This loss of efficiency is measured by the coor¬ 
dination raf/rj^jfCR) H3I4I61 101121201211 min[i i(PE)\/ max[u(A)]. Regrettably, when 
focusing on one sole objective (e.g. making money or a higher GDP), there are losses 
of efficiency that are not measured (e.g. non-sustainability of productions or produc¬ 
tion of addictive carcinogens). This appeals for a more thorough analysis of the loss of 
efficiency at equilibrium and the definition of a multi-objective coordination ratio. 

The utilitarian social welfare u : A —>• EV[ is a vector-valued function measuring 
social welfare with respect to the d objectives: u(a) = JT gJv tT(a), excluding the 
purely behavioural objectives that cause irrationality (23l . Given a function f : A—t Z, 
the image set f(E) of a subset E C A is defined by f{E) = {f(a)\a £ E} C Z. Given 
p,y,z £ R+, the vector p * y £ R+ is defined by Vfc £ 0,(p * y)k = PkVk and the 
vector y/z £ R(J_ is defined by Vfc £ O, ( y/z)k = Vk/zk- F°r x £ R+, x*Y denotes 
{x -k y £ R(J_ | y £ Y}. Given x £ R+, C( o 
introduce]^] the notations £ and T, illustrated in Figures gan dg 

- A = u(A) the set of outcomes. (•) 

- £ = rt(PN) the equilibria outcomes. (♦) 

- T = EFF[u(j 4)] the efficient outcomes, (x) 

For SO games, the worst-case efficiency 
of equilibria is measured by the CR 
min [u(PE)\/ max[it(A)]. However, for MO 
games, there are many equilibria and optima, 
and a ratio of the (green) set £ over the (red) set 
IF is not defined yet and ought to maintain the 
information on each objective without introducing 
dictatorial choices. 
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Fig. 1: The bi-objective utilitar¬ 
ian vectors of Ocean Shores 


I introduce a multi-objective CR. Firstly, the efficiency of one equilibrium y £ £ 
is quantified without taking side with any efficient outcome, by defining with flexibil¬ 
ity and no dictatorship, a disjunctive set of guaranteed ratios of efficiency R[y , F] = 
U z£T C(y/z). Secondly, in MOGs, in average, there are many Pareto-Nash-equilibria. 
An efficiency guarantee p £ R+, must hold for each equilibrium-outcome, inducing the 

2 As Smoothness f20l cannot be applied to MO games, I cannot use the term Price of Anarchy. 

3 To enable ratios, one can do the minor assumption F C R+ + . 





conjunctive definition of the set of guaranteed ratios R [£, F] = f) yg£ R\Vi -T}- Techni¬ 
cally, R [£, F] only depends on WST[£] and F . Finally, if two bounds on the efficiency 
p and p' are such that p >~ p', then p' brings no more information, hence, MO-CR is 
defined by using EFF on the guaranteed efficiency ratios i?[WST[£], F]. This MO-CR 
satisfies a set of key properties detailed in Appendix |B| 

Definition 4 (MO Coordination Ratio). Given an MOG, a vector p £ bounds 
the MOG’s inefficiency if and only if it holds that: \/y £ £, 3 z £ F, y/z F p. 

Consequently, the set of guaranteed ratios is defined by: 

R[£,F] = HU C(y/z) 

y€£ 


and the MO-CR is defined by: MO-CR[£, F] = EFF[R[WST [£], F)) 


Example 2 (The Efficiency ratios of Example [7}. 1 depict the efficiency ratios of Ocean 
Shores (intersected with [0, l] d ) which depend of WST[£] = {(30, 53), (40,38)} and 
F = {(46,61),..., (69,31)}. The part below the red line corresponds to f?[(30, 53), F], 
the part below the blue line to f?[(40,38), F\ and the yellow part below both lines is the 
conjunction on both equilibria f?[WST[£], F]. The freedom degree of deciding what 


the overall efficiency should be is left free (no 
dictatorship) which results in several ratios in 
the MO-CR. Firstly, for each p £ R[£,F], we 
have pi < 65%. Hence, whatever the choices 
of overall efficiency, one cannot guarantee more 
than 65% of efficiency on objective 1. Secondly, 
there are some subjectivities for which the effi¬ 
ciency on objective 2 is already total (100%, if 
not more) while situation on objective 1 is worse 
and only 50% can be obtained. Thirdly, from 
50% to 65% of subjective efficiency on objec¬ 
tive 1, the various subjectivities range the effi¬ 
ciency on objective 2 from 100% to 75%. 
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Fig. 2: The MO-CR of Ocean Shores 




T : efficient outcomes 


Having p in MO-CR means that for each 
y £ £, there is an efficient outcome z^ £ 
F such that y dominates p * z^ v \ In other 
words, if p £ R[£, F], then each equilib¬ 
rium satisfies the ratio of efficiency p. This 
means that equilibria-outcomes are at least 
as good as p*F. That is: £ C (p*F) +R^. 
Moreover, since p is tight, £ sticks to p-k F. 


Fig. 3: p £ MO-CR bounds below £’s inefficiency: £ C (p * F) + 













3 Application to Tobacco Economy 


Tobacco consumption is a striking example of economic inefficiency induced by 
bounded rationalities. According to the World Health Organisation |[25l . 17.000 hu¬ 
mans die each day of smoking related diseases (one person per 5 seconds). Meanwhile, 
addictive satisfaction and the financial revenue of the tobacco industry fosters consump¬ 
tion and production. According to the subjective theory of value li24l . some economists 
would say: “Since consumers value the product, then the industry creates value.” Ac¬ 
cording to other health economists ll23l . most consumers become addict before age 18, 
and as adults, would prefer a healthier life, but fail to opt-out. 

The theory of MO games, based on a non-theory of value, just maintains vectorial 
evaluations and properly considers dollars, addiction and life expectancy as distinct 
objectives, with PN equilibria encompassing the relevant behaviours, even irrational. 
We modelled the tobacco industry and its consumers mm by a succinct MOG, 
with the help of (..) the associatiotj^ “ Alliance contre le tabac”. The set of agents is 
N = {industry, v consumers}, where there are about v = 6.10 9 prospective con¬ 
sumers. Each consumer decides in .4 c "" sulncr = {not-smoking, smoking} and cares 
about money, his addictive pleasure, and living. The industry only cares about money 
and decides in A lndustry = {not-active, active, advertise&active}. We have O = 
{money, reward, life-expectancy}. The tables below depict the evaluation vectors (over 
a life-time and ordered as in O ) of one prospective consumer and the evaluations of the 
industry with respect to the number 6 £ {0,..., v} of consumers who decide to smoke. 
The money budget (already an aggregation) is expressed in kilo-dollanfj|~| the addictive 
reward is on an ordinal scale {1, 2,3,4}; life-expectancy is in years. 


^consumer 

not-active active 

advertise&active 

not-smoking 

smoking 

(48,1,75) (48,1,75) 
(48,1,75) (12,3,65) 

(48,1,75) 

(0,4,55) 


^industry 

not-active active advertise&active 

(y — 9)x 
+ 9 x 

(0,-,-) (0,-,-) (0,-,-) 

(0,-,-) (26,-,-) (36,-,-) 


Pareto-Nash equilibria. If the industry is active, then for the consumer, deciding to 
smoke or not depends on how the consumer subjectively values/weighs money, addic¬ 
tion and life expectancy: both decisions are encompassed by Pareto-efficiency. For the 
industry, advertise&active is a dominant strategy. Consequently, Pareto-Nash-equilibria 
are all the action-profiles in which the industry decides advertise&active. 

Efficiency. Since addiction is irrational (detailed in Appendix [C|, I focus on money 
and life-expectancy. We have £ = {0(36,55) + (v — 0)(48,75) | 0 < 9 < v} and 

4 I am grateful to Clemence Cagnat-Lardeau for her help on modelling tobacco economy. 

5 Note that most states set the prices of tobacco, hence prices do not follow supply/demand. 

6 These numbers differ from 1231 which aggregates everything (e.g. life expectancy) into money. 









T = {zx(48, 75)}, where v is the world’s population, and 9 the number of smokers. 
Since WST[£] = {(36, 55)}, the MO-CR is the singleton {(75%, 73%)}: in the worst 
case, we lose 12k$ and 20 years of life-expectancy per-consumer. These Pareto-Nash- 
equilibria are the worst action-profiles for money and life-expectancy, a critical infor¬ 
mation that was not lost by this MOG and its MO-CR. 

Practical lessons. Advertising tobacco fosters consumption. The association “ Alliance 
contre le tabac ” passed a law for standardized neutral packets (April 3rd 2015), in order 
to annihilate all the benefits of branding, but only in France. The model indicates that: 

This law will promote a higher economic efficiency. 


4 Computation of the MO-CR 

In this section, I provide a polynomial-time algorithm for the computation of MO-CR 
which relies on a very general procedure based on two phases: 

1. Given a MOG, compute the worst equilibria WST[£] and the efficient outcomes T. 

2. Given WST[£] and T, compute MO-CR = EFF[ R[ WST[£] , T} ]. 

Depending on the input (normal form or compact representation), it adapts as follows. 

4.1 Computation of the MO-CR for Multi-objective Normal Forms 

For a MOG given in MO normal form (which representation length is L = na n d ), 
Phase 1 (computing WST[£] and T) is easy and takes time 0(L 2 ) (see Appendix [d]). 
For d = 2, this lowers to 0(Llog 2 (L)). Let us denote the sizes of the outputs q = 
|WST[£]| and m = \IF\. For normal forms, it holds that q,m = 0(|.A|) = O(L). 

For Phase 2, at first glance, the development of the intersection of unions 
f?[WST[£], JF\ = n y6W sT[£] U 2S jf C(y/z) causes an exponential m q . But fortunately, 
one can compute the MO-CR in polynomial time. Below, D 1 is a set of vectors. Given 
two vectors x, y £ R+, let x A y denote the vector defined by Vfc GO, (x A y)k = 
min {xk,yk} and recall that Vfc G O, ( x/y)k = Xk/yk- Algorithm [l] is the develop¬ 
ment of n ye wsT[£] C(y/z), on a set-algebra of cone-unions. Appendix |d| shows 

that Algorithmflltakes time 0((qm) 2d ~ 1 d), or 0((qm) 2 log 2 (gm)) for d = 2. 


Algorithm 1: Computing MO-CR in polynomial-time 0((qm) 2d 1 d.) 

Input: WST[£] = {y 1 ,. .. , y q } and T = {z 1 ,..., z m j 
Output: MO-CR = EFF(R[WST[£], F\] 

create D 1 t— {y 1 /z G R+ | z G T} 
for t = 2,..., q do 

D* t- EFF {{p A tf/z) \ p G D*~ x , z G T}] 

end 

return D q 


Having specified Phase 1 and 2 for normal forms. Theorem 1 follows: 

Theorem 1 (Computation of MO-CR). Given a MO normal form, one can compute 
the MO-CR in polynomial time 0(L 4d-2 ). If d = 2, it lowers to 0(L 4 log 2 (L)). 





4.2 Computation of the MO-CR for Multi-objective Compact Representations 

Compact representations of massively multi-agent games (e.g. MO graphical games, 
MO action-graph games) have a representation length L that is polynomial with respect 
to the number of agents n and the sizes of the action-sets a. As q = |WST[£]| and 
m = | E\ can be exponentials a n of this representation length, compact representations 
are algorithmically more challenging, leaving open the computation of WST[£] and E 
in Phase 1, and complicating the use of Algorithm |T| in Phase 2. To overcome this, 
one can do MO approximations 1(161 . by implementing an approximate Phase 1 which 
precision transfers to Phase 2 in polynomial time, as follows. 

Lemma 1. Given £i, £2 > 0 and approximations E of £ and F of E in the sense that: 

Vy e £,3 y' £ E, y>zy' and Vy' £ E, By £ £, (1 + £1 )y' £ y (3) 

Vz' £ F,3z £ E, 7 ? £3 z and Vz £ E, 3 z’ £ F, (1 + £ 2 )z £3 z! (4) 

it holds that R[E, f] C R [£, E] and: 

fp£R[£,E},3p’ £R[E,F], (l + ei )(l +s 2 )p'tP (5) 

Equations ([ 3 ]) and 0 state approximation bounds. Equations 0 state that (1 + E\)~ 1 £ 
bounds below E which bounds below £. Equations Q state that E bounds below F 
which bounds below (1 + ef)E. Crucially, whatever the sizes of £ and E, there exist 
such approximations E and F that are 0((l/£i) d_1 ) and 0((l/£ 2 ) d_1 ) sized lfl 6 fl . 
yielding the approximation scheme below. 

Theorem 2 (Approximation Scheme for MO-CR). Given a compact MOG of repre¬ 
sentation length L, precisions £1 , £2 > 0 and two algorithms to compute approxima¬ 
tions E of £ and F of E in the sense of Equations 0 and 0 that take time 0f(£i, L ) 
and 9jr(e 2 , L), one can approximate R[£, E] in the sense of Equation 0 in time: 

o{e £ {s i,L) + 9jr(e 2 ,L) + (E 1 £ 2 y (d ~ 1){2d ~ 1) ^ 

For MO graphical games. Phase 1 could be instantiated with approximate junction-tree 
algorithms on MO graphical models (9|. For MO symmetric action-graph games, in 
the same fashion, one could generalize existing algorithms ED- More generally, for 
WST[£] and E, one can also use meta-heuristics with experimental guarantees. 

5 Prospects 

Multi-objective games can be used as a behaviourally more realistic framework to 
model a wide set of games occurring in business situations ranging from carpooling 
websites to combinatorial auctions. Also, studying the efficiency of MO generalizations 
of routing or Cournot-competitions D 1 could provide realistic economic insights. 
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A Why Smoothness will not work on multi-objective games 


Most single-objective price of anarchy analytic results rely on a smoothness-analysis 
ll20l . A crucial step for “smoothness” is to sum the best response inequalities: For 
a single-objective game and an equilibrium a £ PN, from the best-response con¬ 
ditions Vi £ N,Mb l £ A l ,v % (a) > v l (b l , a~ l ), one has: Mb £ A.J^i = iV l (a) > 
1 (b 1, , a ~ l ). However, the Pareto-Nash-equilibrium conditions are rather: Mi £ 

TV, W £ A 1 , v l (b x ,a~ l ) )/- v l (a). As shown in the following counter-example, such )/- 
relations cannot be summed: 



Consequently, Smoothness-analysis does not encompass Pareto-Nash equilibria, re¬ 
gardless of the efficiency measurement chosen. 


B Properties of the Multi-objective Coordination Ratio 

The Multi-objective Coordination Ratio fulfils a list of key good properties for the thor¬ 
ough measurement of the multi-objective efficiency of MO games. 

B.l Worst case guarantee on equilibria outcomes 
and No dictatorship on efficient outcomes 

Each vectorial efficiency ratio that the MO-CR states, bounds below the efficiency for 
each equilibrium outcome, compared to an existing efficient outcome: 

Mp £ MO-CR[£, T], My £ £, 3z £ T, y/z>zp 

The process of measuring efficiency by MO-CR does not imply any choice in T that 
would impose a point-of-view telling what efficiency should be (e.g. no five-year plans). 

B.2 Multi-objective ratio-scale 

Given £, T and r £ M+ + , it holds that: 


MO-CR [f, T] 

C 

(6) 

MO-CR[{(0,..., 0)}, J 7 ] 

= {(0.0)} 

(7) 

MO-CR[r*£, T] 

= r * MO-CR [£,F\ 

(8) 

MO-CR [£, r * J 7 ] 

= MO-CR [f, F\/r 

(9) 

£ C T 

& (1,...,1) £MO-CR[f,J"] 

(10) 


Equation ([6]» states that the MO-CR is expressed in the multi-objective space. It is worth 
noting that while MO-CR[£, T] C [0, l] d is a more classical choice, MO-CR also al¬ 
lows for measurements of over-efficiencies. (E.g. if T is a family-car and £ is a Lam¬ 
borghini, then there is over-efficiency on the speed objective.) 


Equations (|7j, (|8]i and ([9]) state that MO-CR is sensitive on each objective to multi¬ 
plications of the outcomes. For instance, if £ is three times better on objective k, then 
so is MO-CR. If there are twice better opportunities of efficiency in F on objective k', 
then MO-CR is one half on objective k!. In other words, the efficiency of each objective 
independently reflects into the MO-CR in a ratio-scale. 

If all equilibria outcomes are efficient (i.e. £ C F), then this must imply 
that according to the MO-CR, the MO game is fully efficient, that is: (1,..., 1) £ 
MO-CR \£, F], The MO-CR seems to be the only multi-objective ratio-scale measure¬ 
ment that fulfils Equation ( |T()| while being a worst case guarantee on equilibria out¬ 
comes with no dictatorship on what efficiency should be. 

It is also worth noting that MO-CR is MO-monotonic with respect to £ and F. For 
A', Y C R^_, let X >Y denote Y C C(X) where C(X) = U x£ xC(x) (i.e. X dominates 
Y). Then it holds that: 


£\>£' => MO-CR[£, F] > MO-CR[£' F] 

F\>F' => MO-CR[£, F'} > MO-CR[£, F] 


( 11 ) 

( 12 ) 


C Bounded Rationality in Tobacco Consumption 

According to the intrinsic theory of value a the value of a cigarette objectively 
amounts to the quantities of raw materials used for its production, or is the combi¬ 
nation of the labour times put into it lfj~4ll . However, each economic agent needs to 
keep the freedom to evaluate and act how he pleases, in order to keep his good will 
and some economic efficiency, as observed in the end of the Soviet Union. According 
to the subjective theory of value El, the value of a cigarette amounts to the price an 
agent is willing to pay for it. Since the consumers value the product, then the industry 
creates value l24l . However, this disregards what the disastrous consequence is on life 
expectancy, belittles 7.500.000 deaths-per-year and emphasizes the bounded rationality 
of behaviours. While for some health economists, consuming a cigarette is a rational 
choice, as one values pleasure more than life expectancy, for others, consumers are 
stuck into addiction before becoming adults. The truth is likely between these two ex¬ 
treme points of view l23l : Economic agents discount the future at a rate of 6% per-year, 
hence a day of life in 40 years is valued 10 times less than now, leading to overweight¬ 
ing the actual smoking pleasure and to irrational behaviours with respect to preferences 
over a full lifetime. Agents behave according to objectives (e.g. addictive satisfaction) 
that they would avoid if they had the full experience of their lifetime (e.g. a lung cancer 
with probability 1/2) and a sufficient will (e.g. quit smoking). Time discounting also 
explains other non-sustainable behaviours like over-fishing catastrophes. 

D Proofs 

D.l Phase 1 for Normal Forms, Correctness of Algorithm [Tjand Theorem [1] 

Phase 1 is easy, if the MOG is given in normal form. The MOG is made of the MO 
evaluations of each agent on each action-profile, that is: 0(na n ) vectors. Hence, the 


computation of u(A) requires for each a G A, the addition of n vectors. There¬ 
fore, the computation of u(A) takes time 0{a n nd) (linear in the size of the in¬ 
put) and yields 0(a n ) vectors. The computation of T = EFF[it(A)] given u(A) 
takes time 0(|u(A)| 2 <i) = 0(a 2n d). To conclude, the computation of T takes time 
0(na n d + a 2n d), which is polynomial (quadratic) in the size of the input. If d = 2, 
this can be significantly lowered to 0 (na n d + a n log 2 (a")d) = 0 {na n log 2 (a)). 

The computation of WST[£] can be achieved by first computing PN. For this 
purpose, for each agent i G N and each adversary action profile a~ l G A~\ 
one has to compute which individual actions give a Pareto-efficient evaluation in 
v t (A l ,a~ l ), in order to mark which action-profiles can be a PN from i’s point of 
view. Overall, computing PN takes time 0(na n ~ 1 a 2 d). Using back u(A), computing 
£ = it (PN) is straightforward. Again, the computation of WST[£] given £ takes 
time 0(\£\ 2 d) = 0(a 2n d). To sum up, the computation of WST[£] takes time 
0 (na n+1 d + a 2n d). ltd = 2, this lowers to 0 (na n log 2 (a)). 

In order to compute MO-CR = EFF[.R[WST[£], J 7 ]], let us study the structure of 
f\ewsT[£] UzeJF^(y/~)’ by restricting a set-algebra to the following objects: 

Definition 5 (Cone-Union). For a set of vectors X C M'_(, the Cone-Union C(X) is: 

C(X) = |J C(x) ={yGR d + | 3xGX,xtu} 

x£X 

Let C denote the set of all cone-unions ofWf 

To define an algebra on C, one can supply C with U and (T. 

Lemma 2 (On the Set-Algebra (C, U, (T)). 

Given two descriptions of cone-unions X 1 , X 2 C we have: 

CiX 1 ) U C(X 2 ) = CiX 1 U X 2 ) 

Given two descriptions of cones x 1 ,x 2 G EH , we have: 

C(x x ) (T C(x 2 ) = C(x 1 A x 2 ) 

where x 1 A x 2 G M+ is: V/c G O, (x 1 A x 2 )k = rninja:)., x 2 }. 

Given two descriptions of cone-unions X 1 , X 2 C we have: 

cix 1 ) nc{x 2 ) = (u x i eJC i cix 1 )) n ( u X 2 eX 2C{x 2 )) 

= c(x 1 ) nc(x 2 ) 

(x 1 ,x 2 )ex 1 xx 2 
= C(x 1 Ax 2 ) 

(x 1 ,x 2 )ex 1 xx 2 

= C(X 1 A X 2 ) 

where X 1 A X 2 = {x l A a; 2 | x l G X 1 , x 2 G X 2 } C R+. 

Therefore, (C, U, (T) is stable, and then is a set-algebra. 



Proof. The three properties derive from set calculus. 


The main consequence of Lemma[2]is that f?[WST[£], F) = n, y cwsr[f] U 2 EF C(y/z) 
is a cone-union. Moreover, one can do the development for n ygW sT[£] U 2g jr C{y/z) 
within the cone-unions, using distributions and developments. 

Remark 1. For a finite set X C we have: C(X) = C(EFF[X]). 

Proof. Firstly, we prove C(X) C C(EFF[X]). If y £ C(X), then there exists x £ X 
such that x £3 y. There are two cases, x £ FEE [A'] and x f EFF[X]. If x £ FEE [A - ], 
then y £ C(EFF[X]), by definition of a cone-union. Otherwise, if x f EFF[X], then 
there exists z £ X such that z y x. And since X is finite, we can find such a z in 
EFF[X], by iteratively taking z' y z until z £ EFF[X], which will happen because 
X is finite and y is transitive and irreflexive. Hence, there exists z £ EFF[X] such 
that 2 y x £ y and then 2 £3 y. Consequently, y £ C(EFF[X]), by definition of a 
cone-union. 

Conversely, Y C X => C(Y) C C(X) proves C(EFF[X]) C C(X). 

As a consequence of Remark [l] for x £ a simple cone C(x) is fully described by 
its summit x. The main consequence of this remark is that C(X) can be fully described 
and represented by EFF[X], For instance, since /I’jWSTlA], T] is a cone-union (thanks 
to Lemma [2|, and since MO-CR = EFF[f?[WST[£], T}} (by definition of the MO-CR), 
then i?[WST[£], F] is fully represented (as a cone-union) by the MO-CR, which means 
that f?[WST[£], F] = C (MO-CR). 

Recall that q = |WST[£]| and m = |X|. In this subsection, we also denote WST[£] = 
{y 1 ,..., y q } and F = { 2 1 ,..., z m j. Let A™ denote the set of functions n from 
{1,..., < 7 } to {1,, to}. (We have: = to 9 .) 

Corollary 1 (The cone-union of MO-CR). 

Given WST[£] = { y 1 ,..., y q } and F = { 2 1 ,..., z m }, we have: 


R[WST[£\,F\= |J p| Ctf/z*®) 


7TG A™ t= 1 


and therefore: 



Proof. For the first statement, just think to a development. We write down 
f?[WST[£], F] = (Tj /g wsT[£] U Z £jr C{y/z) into the layers just below. There is one layer 
per y 1 inWST[£] = {y 1 ,.. -, y\ ■ ■ ■, y q }- 


(C(£)UC(£)U...UC(£)) layer 1 
n(C(^)UC(g)U...UC(^))layer2 


fl (C(^)UC(^)U...UC(^)) layer q 




Imagine the simple cones as vertices and imagine edges going from each vertex of 
layer t to each vertex of the next layer (t + 1). The development into a union outputs 
as many intersection-terms as paths from the first layer to the last one. Let the function 
7 r : {1 ,..., g} —» {1,..., m} denote a path from layer 1 to layer q , where n (t) is the 
vertex chosen in layer t. Consequently, in the result of the development into an union, 
each term is an intersection n*=i C(y t /z^^). 

The second statement results from the first statement. Lemma [2] and Remark |T| 

_R[WST[£], J 7 ] = |J f )C(y t /z^) 

nEA™ t=t 

= U c(A 

nEA™ \t = 1 / 

= c{\^\y t /z^ 7T £ a:; 

That R[WST[£], T) = C (MO-CR) concludes the proof. 

Ultimately, this proves the correctness of Algorithm [I] for the computation of MO- 
CR, given WST[£] = {y 1 ,..., y q } and T = {z 1 ,..., z m }. It consists in the iterative 
development of the intersection R(£, T), which can be seen as dynamic programming 
on the paths of the layer graph. For fc € {1..... r/ }, we denote D t the description of the 
cone-union corresponding to the intersection: 

C(£> t ) = nf =1 U ze rC{y l /z) 

Recursively, for t > 1, C(D 4 ) = C{D t ~ 1 ) fl (U- e jr C(y t /z)). From Lemma|2]and 
Remark[I] in order to develop, we then have to iterate the following: 

D t = EFF[{p A {y*/z) \ p £ D*" 1 , « £ F}\ 

We now proceed with the time complexity of Algorithm [T] At first glance, since there 
are m q paths in the layer graph, then there are 0(m q ) elements in MO-CR. Fortunately, 
they are much less, because we have: 

Theorem 3 (MO-CR is polynomially-sized). 

Given a MOG and denoting d = \0\, q = |WST[£]| and m = jJ 7 !, we have: 

\MO-CR\ < ( qm) d_1 

Proof. Given p £ MO-CR, for some t r £ A™, we have p = At=i y t /z’"^ t \ and 

then Vfc £ 0,pk = min i= i ...q{y\/ Therefore, pk is exactly realized by the 
fcth component of at least one cone summit y t /z*^ in the layer graph (that is a vertex 
in the layer-graph above). Consequently, there are at most as many possible values for 
the fcth component of p, as the number of vertices in the layer graph, that is qm. This 
holds for the d components of p; hence there are at most ( qm) d vectors in MO-CR. 
More precisely, by Lemma[3](below), since MO-CR is an efficient set, then there are at 
most (gm) d_1 vectors in MO-CR. 



Lemma 3. Let Y C be a set of vectors, with at most M values on each component: 

I EFF[Y} | < M d_1 

Proof. At most M d_1 valuations are realized on the d - 1 first components. If you fix 
the d— 1 first components, there is at most one Pareto-efficient vector which maximizes 
the last component. 

In Algorithm [I] there are 0{q) steps. At each step t, from Theorem [5] we know that 
|.D <_1 | < (gm) d_1 . Hence, \{p A (y t / z) \ p £ D t_1 , z £ J 7 }! < q d ~ 1 m d , and the 
computation of the efficient set D t requires time 0((q d ~ l m d ) 2 d). However, by using an 
insertion process, since there are at most B = |_D*| < (qm) d ~ l Pareto-efficient vectors 
at each insertion, then we only need 0(q d ~ 1 m d x ( qm ) d-1 ) Pareto-comparisons. If 
d = 2, time lowers to Q(q d ~ 1 m d lo g 9 (qm)d 2 ) = 0(qm 2 log 2 (<?TO)). 

Ultimately, Algorithm [I] takes q steps and then time 0(q(q d ~ l m d )(qm) d ~ 1 d) = 
0{{qm) 2d ~ 1 d). If d = 2, this lowers to 0((qm) 2 log 2 (qm)). 

D.2 Approximations: Proof of Lemma [l] and Theorem [2] 

Proof (1) First, let us show R[E,F] C i?[WST[£], J 7 ]. Let p' be a ratio of R[E,F] 
and let us show that: 


\/y £ WST[f], 3z £ F, s.t.: y >Z p'* z 

Take y £ WST[£]. From Equation ([3|> (first condition), there is a y' £ E such that 
y A y'. From Definition [4] there is a z' such that y' ^ p' * z'. From Equation Q on z' 
(first condition), there exists z £ T such that z' ^3 2 . Recap: y £3 y' £3 p' * z! ^3 p' * z. 
(2) Then, let p be a ratio of f?[WST[£], J 7 ], and let us show that p' = (1 + £i) -1 (l + 
£2 ) _ V is in R[E, F], that is: 

Vj/ £ E, 3z' £ F, (1 + £i)y’ >3 (1 + £ 2 )~ 1 p*z' 

Take an element y’ of E. From Equation (f3l) (second condition), there is y £ WST[T] 
such that Q + £1 )y' ^3 y. From Definition^] there is z £ T such that V^P*z. From 
Equation (Bb on 2 (second condition), there exists z' £ F s.t. z f (1 -h £ 2 )~ 1 z'. Recap: 
(1 + £ 1 ) 2 / >yit>z p*z>y (1 + £ 2 

Proof (Theorem |2|. For the first claim, since Algorithm |T| given E and F, outputs 
the MO-PoA corresponding to R[E, F], by Theorem|2] Algorithm [I] outputs an ((1 + 
£i)(l + £ 2 ))-covering of f?(WST[£], F). 

For the second claim, from Lemma [T| applying Algorithm |T| on E and F 
outputs an ((1 + £i)(l + £ 2 ))-covering of i?(WST[£], F). Moreover, since we 
have \E\ = 0((\/ei) d ~ l ) and |F| = 0 ((l/£ 2 ) d_1 ), Algorithm [l] takes time 
0(d/(e 1 e 2 ) ( ' d - 1){2d - 1) ). 


